Extracting Size and Shape Information of Sound Source in an Optimal Auditory Processing Model

نویسندگان

Toshio Irino

Roy D. Patterson

چکیده

We hear phonemes pronounced by men, women and children as approximately the same although the length of the vocal tract varies considerably from group to group. At the same time, we can identify the speaker group. This suggests that we extract and separate the size and shape information of sound sources. The impulse response of the vocal tract is compressed or expanded in time when the length of the vocal tract is compressed or expanded proportionally with the same cross-area function. The compressed and dilated versions of the impulse response can be converted into the same distribution using the Mellin transform. In this paper we show that the Mellin transform can be applied to the stabilised wavelet transform that forms the basis of the Auditory Image Model (AIM) of processing in the auditory pathway. The combined processing normalizes source size information and produces a new, fruitful representation of source shape information, referred to as the “Mellin Image.” This “Stabilised Wavelet-Mellin Transform” (SWMT) also provides the mathematical framework for the derivation of the gammachirp auditory filterbank (Irino and Patterson, 1997).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stabilised wavelet mellin transform: an auditory strategy for normalising sound-source size

متن کامل

Selective deficits in human audition: evidence from lesion studies

The human auditory cortex is the gateway to the most powerful and complex communication systems and yet relatively little is known about its functional organization as compared to the visual system. Several lines of evidence, predominantly from recent studies, indicate that sound recognition and sound localization are processed in two at least partially independent networks. Evidence from human...

متن کامل

Sound resynthesis from Auditory Mellin Image using STRAIGHT

We propose an Auditory VOCODER to resynthesize sound from the Auditory Mellin Image which is an auditory representation that segregates the size and shape information of incoming sound. The sound resynthesis part consists of three techniques: the STRAIGHT VOCODER [2], frequency-warping cepstral analysis [4,12], and nonlinear multivariate regression analysis (MRA). We explain these methods and t...

متن کامل

Selective deficits in human audition: evidence from lesion studies

متن کامل

Calculation of the drop in sound pressure level and frequency analysis of aerospace engine test cell (Research Article)

Aerospace engines testing is a source of noise pollution and determining the low frequency acoustic characteristics of the test cell, plays an important role in optimally control of the sound field and reducing the level of sound pressure and pollution. In this study, the drop in average sound pressure level is numerically predicted by constructing a test cell according to ISO 140 standard. To ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Extracting Size and Shape Information of Sound Source in an Optimal Auditory Processing Model

نویسندگان

چکیده

منابع مشابه

Stabilised wavelet mellin transform: an auditory strategy for normalising sound-source size

Selective deficits in human audition: evidence from lesion studies

Sound resynthesis from Auditory Mellin Image using STRAIGHT

Selective deficits in human audition: evidence from lesion studies

Calculation of the drop in sound pressure level and frequency analysis of aerospace engine test cell (Research Article)

عنوان ژورنال:

اشتراک گذاری